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Introduction 

The  capacity  of  the  Gaussian  channel  without  feedback,  subject  to  a 
generalized  energy  constraint,  is  determined  in  [1].  In  that  work,  the 
constraint  is  given  in  terms  of  the  covariance  of  the  channel  noise  process. 
However,  there  are  many  situations  where  one  may  wish  to  determine  capacity 
subject  to  a  constraint  determined  by  a  covariance  that  is  different  from  that 
of  the  channel  noise.  Examples  are  jamming  or  countermeasures  situations,  or 
when  there  is  insufficient  knowledge  of  the  natural  environment. 

Channels  where  the  covariance  of  the  noise  is  the  same  as  that  of  the 
constraint  will  be  called  matched  channels;  otherwise,  we  say  that  the  channel 
is  mismatched  (to  the  constraint).  In  this  paper,  the  capacity  of  the  mismatched 
Gaussian  channel  is  determined.  Kesults  for  a  restricted  class  of  mismatched 
channels  are  given  elsewhere  [2],  Various  special  cases  of  the  mismatched 
channel  have  been  treated  previously  [3]  -  [5]. 

The  results  for  the  mismatched  channel  differ  significantly  from  those  for 
the  matched  channel.  A  discussion  of  these  differences  follows  the  proof  of 


the  main  result. 
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Definitions  and  Structure 

The  channel  is  defined  as  in  [1].  H-j  and  are  real  separable  Hilbert 

spaces  with  Borel  o  fields  3-|  and  and  inner  products  and  <*»*>2* 

The  message  process  X  in  H-j  is  represented  by  a  probability  yx  on  (H-|,$-|). 

The  message  is  encoded  into  the  transmitted  signal  A(X)  in  by  a  g.| /^-measurable 

coding  function  A.  To  each  sample  function  of  the  signal  process,  the  channel 

adds  a  sample  function  from  the  noise  process  N,  represented  by  a  Gaussian  measure 

y  on  (H The  received  signal  (channel  output)  is  then  a  sample  function 
N  t  1 

from  the  process  Y  =  A(X)  +  N,  represented  by  the  measure  yy.  As  usual  X  and  N 

are  assumed  independent,  so  that  yy(B)  =  yx®y^{(X,Y):  A(X)  +  Y  eB}  where 

^0y^  is  product  measure.  The  channel  probability  yXy,  which  has  marginal  measures 

yy  and  yy,  is  a  measure  on  (H^  x  B-j  x  g«)  defined  by  y^y  (C)  = 

y^Sy^j  { ( X , Y ) :  (X,A(X)  +  Y)  e  C}.  The  average  mutual  information  is  then  I[yXy], 


where  I [y^y]  = 00  if  it  is  false  that  y^y  is  absi 
to  yx0yy  (yxy«yx0yy),  and  otherwise  I[yxy]  = 


is  absolutely  continuous  with  respect 


XY 


pairs  (yx,A). 

For  this  paper,  a  covariance  operator  in  a  Hilbert  space  will  be  defined  to  be 
a  symmetric,  non-negative,  and  trace-class  bounded  linear  operator.  The  constraint 
on  the  transmitted  signal  process  A(X)  will  be  given  in  terms  of  a  covariance 
operator  Rw  in  H£;  as  is  well-known,  to  every  such  covariance  operator  there 
corresponds  a  zero-mean  Gaussian  measure  on  (. H^ , 62 )  - 

When  H2  =  I_2  [0,T]  and  R  is  a  covariance  operator,  R  can  be  represented  as  an 
integral' operator  with  kernel  R  which  is  a  covariance  function.  There  is  then  a 
well-known  isomorphism  between  range  (R1/'2)  and  the  reproducing  kernel  Hilbert  space 
of  R.  All  measures  considered  here  will  be  assumed  WLOG  to  have  zero  mean.  The 
capacity  will  he  determined  under  the  following  assumptions: 
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(A-l) 


1/2  1/2 

Rj^  =  R^j  (I  +  S)  Rj  ,  where  Rw  is  the  covariance  operator  of  the  noise 


'W 


N 


measure  ,  and  S  is  a  bounded  linear  operator  with  pure  point  spectrum  that  does 

not  have  -1  as  a  limit  point  of  its  point  spectrum; 

(A-2)  The  admissible  set  Q  is  the  set  of  all  (yv. ,A)  such  that 
L  * 


S 


H, 


|RW  ^2  A(x)  |  |2  dy^(x)  P,  where  P  >  0  is  fixed. 


_ _  1  /  o 

It  will  be  assumed  WLOG  [1]  that  range  (Rj^  =  H^,  so  that  Rj^ '  exists. 

-1/2  l/p 

Assumption  (A-l)  then  implies  that  R^  '  exists;  in  fact,  that  range  (R^  )=  range  (R^ 

1/2  1/p  1/2 

Thus,  there  exists  a  unitary  operator  U  in  Hp  such  that  R^  -=  R^  (I  +  S)  .U*,  where 
U*  is  the  adjoint  of  U. 

The  class  of  all  zero-mean  Gaussian  measuTes  y^  with  covariance  operator  as 
in  (A-l)  includes  all  those  that  are  mutually  absolutely  continuous  with  respect 


72 


to  y^,  where  y^  is  zero-mean  Gaussian  with  covariance  Ry [6 ] . 

From  the  results  of  [1],  one  can  limit  attention  to  cases,  where 


yA(X) 


IS 


Gaussian  with  covariance  operator 

V 


2  72 


RA(X)  =  En  Tn[RN  un]  B  tRN  un]  <2> 

where  xn  >  0  for  n  _>  1 ,  Zn  xn  <  °°,{un,  n  _>  1 }  is  a  c.o.n.  set  and  (uav)x  =  <v,x  >  u. 

When  has  ^  f°r  covanar>ce  and  is  Gaussian  then  [1] 

I[yXY3  =  (]/2)'  zp  log  [1  +  x  ].  (3) 


Moreover, 

1 


y 


|R 


X 


W 


72 


A(S) 


=  Trace  Ri 


W 


72 


R  R  J/2 
KA(X)KW 


=  Z  Tn|  |  (I  +  S)V2  U*unl  [|  .  (4) 

Defining  X2  =  xn 1 1  ( I  +  S)  ^2  U*uJ|2,  the  capacity  problem  is  thus 

reduced  to  maximizing  ,l/0  v  y  ,  „2  ,,  .-1-, 

(/2)  l  log  [1  +  Xn  (1  +  Yn)  ']  (5) 

r\ 

over  all  sequences  (X„)  and  c.o.n.  sets  {v  .  n  >  1}  such  that  z  X2  <  P, 

n  n  -  n  n  -  ’ 

where  yn  =  <Svn,  vn  >  2,  n  >  1 . 

The  suprem  urn  of  (5)  subject  to  the  stated  constraint  is  the  capacity 

and  will  be  denoted  as  CjP);  the  capacity  for  the  matched  channel  (R,  =  R  ) 

w  Ny 

will  be  denoted  by  CN(P). 


4 


In  the  case  where  H7  is  infinite-dimensional,  0  will  denote  l im  inf  of  the 
eigenvalues  of  the  operator  S  satisfying  =  Rw  (I+S)R'  .  By  assumption  (A-l), 

0  >  -1.  Un,n>l}  will  denote  the  eigenvalues  of  S  that  are  strictly  less  than  e; 
of  course,  this  set  can  be  empty.  Similarly,  {en»n>l}  will  always  denote  an  o.n. 
set  of  H2  elements  that  are  eigenvectors  of  S  corresponding  to  the  eigenvalues 

<Vn2l).:  Sen  '  W  021 ' 


Preliminary  Results 

Lemma  1 :  Let  (yn),  n  £  M,  be  any  non-decreasing  sequence  of  strictly  positive 
real  numbers.  Let  (Xn)  be  any  sequence  of  M  real  numbers.  Fix  P>0,  and  define 


g(M,P,y)  =  sup 
M  9 

{X::  I  r  <  P} 

^  in  — 


n  (y  +  X2)/y  . 

ji  \  rn  n//  rp 


K  K 

Then  g(M,P,y)=  n  (f  Yi  +  P)/(KYn) 

~  n=1  K 

where  K  <  M  is  the  largest  integer  such  that  E  y .  +  P  >.  Ky^ *  g(M,P,y)  is  uniquely 

1  1  K 

2 

attained  by  (X  )  such  that 


K 

x‘  =  (z  y.  +  P)/K 
n  1  l 


Y  n  <  K 

n  - 


=  0 


n  >  K. 


M 


M  11  -1 

Proof:  Define  f^:  IR  IR  by  f^lyj  =  E  log  [1  +  yn  yn  ]•  We  seek  t0  maximize 


M' 


f^  subject  to  the  constraints 

M 


g(yj  =  z  yn  -  P  <  0 
1  n 


h.  (y.)  =  -yi  <  0,  i  =  1 ,. . ,  M. 

This  is  a  constrained  optimization  problem  w:i±h  objective  function  f^  which  is  concave 

M 

over  the  convex  set  {Z  in  IR  ;  Z.  _>  0,  i  =  1,..,M}.  Moreover,  each  constraint 
function  is  linear.  Thus,  any  solution  to  this  problem  will  define  a  unique 
global  maximum  for  f^  [7].  In  order  that  y*  be  a  solution,  it  is  necessary  and 
sufficient  that  the  following  set  of  equations  be  satisfied  [7]: 

1 


y*  +  +  B-  a-  -  0  i  =  1  ,. . ,  M 

Ji  i 


(6) 
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M  M 

s  y*  -  p  <  0,  b[2  y*  -  p]  =  0 

-y*  <  o,  Sl  y*  =  0,  i  = 


(7) 

(8) 


for  some  set  of  non-positive  real  numbers  {b ,  a .  ,a^}. 

We  first  attempt  to  obtain  a  solution  by  setting  a-j 
This  requires  &(y.  +  yt)  =  -1  for  i  =  thus, 


a  2  “ 


-  ou 


MM  1  M  *  M 

z  y*  +  Ey.  =  -Mb-1,  and  so  y*  =  (e  y.  +  Ey.  )/M  -  y 

1  i  1  1  n  1  1  1  1  n 

for  n  =  1,  2,..,M.  This  definition  of  y*  and  the  constraints  (8)  require  that 
M  M 


E  y*  +  E  y.  >My 
1  i  h  —  "n 


for  n  £  M;  this  inequality  is  satisfied  for  all  n  £  M  if  and  only  if  it  is  satisfied 

_i  *  M  * 

for  n  =  M.  Also,  B  =  -(y-+  y.)  for  i  <  M  implies  B<0,  so  that  E  y-  =  P  by 

A  i  1 

constraints  (7).  Thus,  if  P  +  E  y.  _>  MyM,  an  optimum  solution  is  given  by 

1  1  " 

*  M 

y.  =  (P  +  Ey  -  My. )/M,  i  <  M  . 

I  -I  ll  I 

1  K 

If  there  exists  K<M  such  that  Ky^  £  P  +  Ey.j  <  (K+l  )yK+1 , 


K 


,-1 


then  constraints  (6)-(8)  are  satisfied  by  choosing  B  =  -K[P  +  Ey.]"  , 

1  ' 


ll 


ot  2  ••  ct  ^  8, 


Thus , 
sup 


{X:  EX  <  P} 

~  1  n 


K  K 


K  * 

Ey,  =  P 

1 


1 


y*  =  0,  i  >  K 
i 

v*  -1  ^ 

i  =  K  + 

K 


Ky.j  ],  i  <  K 


-K[P  +  Ey  ]  +  y  .  i  >  K 

1  n  1 


M2  n=l‘Y"  +  X">/Y"  =  n"M  Yi  +  P)/(K'rn) 
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K 

where  K  <  M  is  the  largest  integer  such  that  Ey.  +  P  >  Ky.,.  The  supremum  is 

1  i  -  K 

attained  by  y*  as  defined  above,  or  for 

o  K 


X  =  [P  +  Zy.]/K 
n  L  i'iJ/ 


n  £  K 
n  >  K. 


Lemma 


2 :  Let  (a. ),  1  £  i  £  K,  be  a  non-decreasing  sequence  of  strictly 


positive  real  numbers.  Suppose  that  (y  )  is  a  non-decreasing  sequence  such 
j  J  n  K 

that  Ey  •  >  EA-  for  all  J  <  K,  and  let  P  >  0  be  such  that  Ey.  +  P  >  Ky,,.  Define 
1  1  “  1  1  "  1  1  “  K 


□ 


K 


K 


"My)  =  n  (P  +  Ey. )/ (Ky  ) .  Then  fv  (y)  <  f,,(A)  with  equality  if  and  only  i 
K  ~  n=l  1  1  n  K  ~  _  K  ~ 


y^.  =  A.j  for  all  i  <  K. 


K 


Proof:  For  any  fixed  n,  3fl/(y)/3y„  is  neqative,  usinq  Ey.  +  P  >  Ky,,.  Thus  f„(y) 
.  i\  n  -j  l  —  i\  i\  ^ 

K1  K 

increases  for  y„  decreasing.  One  can  now  assume  that  Ey  =  EA  .  To  see  this, 
n  3  ^  'n  i  n 


K 


K 


suppose 


Ey^  >  EAn-  First  assume  that  there  exists  p  £  K  such  that  Yp > Yp_i 


and 


yp  >  X  .  Define  a  sequence  (yn  )  by  yn 


if  "  *  p>  *p  =  vp 


e  > 


e  -  min  (y 


K 


-i-  ?  \  -  V- 


Conti nui no  to  form  new  sequences  in  this  manner,  one  will  eventually  obtain  a 

J  J  K  K 

non-decreasing  sequence  (y  ')  with  Ey1  £  EA  for  all  J  <  K,  and  either  Ey'  -  ea„ 

n  -j  n  *|  n  i  •  i  i  n 


or  else  y^  =  ~  •••  =  Yp»  where  p  is  the  largest  integer  i  such  that  y..  >  A^ • 

II  it  { 

If  the  latter  case  holds,  define  a  new  sequence  (y  ),  with  y  =  y  for  2  £  n  £  K, 

Kn  n  n 

\  m  J  J 

while  Yi  =  y,  -  e,  e  =  min  (y,-A-i,  E(y.  -  A-)).  (y„)  is  non-decreasing  and  Ey"  >  E  A  . 

ii  ii-jii  n  1^  i  n 

i  ii  ii  K  ( 

If  e  =  Y-,  -  A,,  the  procedure  is  repeated  for  (y  )  and  y 0\  if  e  =  E(y.  -  A-)»  the 
ii  n  ^  *|  i  i 

ii  ii 

procedure  is  repeated  for  (y  )  and  y, .  Continuing  in  this  manner  will  eventually 

n  K  1  „  K 

produce  a  sequence  (y'1)  such  that  z  Y  =  E  A 


1 


1 
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K  K 

Assume  then  that  E  y  =  E  A  .  If  y  and  x  are  not  identical,  let  p  £  K 
1  n  1  n  ~  ~  p-1  p-1  p  p 

be  the  largest  integer  such  that  y_  f  X  :  since  E  y  >_  E  xn  and  E  y  =  E  X  , 

p  p  i  ri  i  n  i  ii  i  ri 

y  <  x  .  Let  t  <  p  be  the  largest  integer  such  that  y.  >  x. ;  such  t  must  exist. 

P  P  L  L 

i  i  i 

Define  a  new  sequence  (yn)  by  yn  =  yp  if  n  f  t,  n  f  p,  while  Yp  =  Yp  + 

Yt  =  Yt  -  e  :=  inf  (X  -  y  ,  yt  -Xt).  f^y)  <  f^y'),  since  (yt  -e)  (yp  +  e) 


=  y^y^  -  e(Yp  -  y^)  -  e  ,  and  Yp  _>  y^.  This  procedure  is  successively  repeated; 
it  will  terminate  when  and  only  when  one  obtains  a  sequence  (y  )  such  that 

I 

y  =  X  for  all  n  <  K. 

'n  n 


□ 


Main  Results 


Theorem  1 : 


(a)  Suppose  that  has  dimension  M<°°.  The  capacity  is  then 


,1, 


K 


UP)  =(72)  E  log 
W  n=l 


K 

E  3-  +  P  +  K 

1  1 

K(i  +  en) 


where 


3]  £  —  •  *  —  are  ei 9enva^ ues  °f  S,  and  K  is  the  largest 


K 


integer  <  M  such  that  Eg.  +  P  >  K  The  capacity  is  attained  by 

-  T  -  |\ 


1 


a  Gaussian  y^)  with  covariance  operator  (2),  where  un  =  Uen  and 


T 

n 


K 

Eg.  +  P  -  K& 

1  1  n 


(1  +  3  for  n  £  K,  xn  =  0  for  n  >  K,  and 


^en»  n  2.  1}  are  o.n.  eigenvectors  of  S  corresponding  to  the  eigenvalues  (3n). 


No  other  Gaussian  yw^\  can  attain  capacity.  The  same  result  is  obtained 
if  Hg  has  dimension  L  <  °°  and  y^)  is  constrained  to  have  support  of  dimension 


M  <  L. 
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(b)  Suppose  that  H 2  is  infinite-dimensional  and  that  support  (uA^)  is 
restricted  to  have  dimension  <  M  <  ». 

(i)  If  (An,n>1  is  empty,  then  CN(PJ  =  (M/2)  log  [1  +  PM~^(l+e)  ^].  Capacity 

can  be  attained  if  and  only  if  S  has  e  as  an  eigenvalue  of  multiplicity  >  M. 
In  this  case  CW(P)  is  attained  by  a  Gaussian  with  covariance  {z) , 

where  ui  =  Uei  and  xi  =  PM-1(l+e)~  for  i<M,  with  any  o.n.  set 

in  the  null  space  of  S. 

K 

(ii)  If  KA„.  s  Vx.  +  P  <  KA„  ,  for  sone  K<M,  then  the  capacity  is  as  in  (a), 

l\  ^  l  i\+ 1 

with  $..  =  A.. ,  i  =  1,..,K,  and  can  be  similarly  attained. 

(iii)  Let  K  =  min(L,M),  where  L  >  1  is  the  number  of  eigenvalues  of  S  whose  value 


K 

is  strictly  less  than  0,  and  suppose  that  P  +  £x.  >  KA...  The  capacity  is  then 

1 
K 


K 

CW(P)  =  %  I  log 

n=l 


*1+0 ' 

.f 

r  \ 

M 

i+x 

2 

n, 

i.  y 

log 


1  +  P  +  l  (x.-e) 
i=l  1 


M(l+0) 


The  capacity  can  be  attained  if  and  only  if  0  is  an  eigenvalue  of  S  with 

multiplicity  >  M-K.  The  capacity  is  then  achieved  by  a  Gaussian  with  covariance 

K  _i  _n 

(2),  where  un  =  Ue„  and  r  =  (£a.  +  P  -  MX  -  Ke ) ( I  +  A  )  M  for  n  <  K,  with 
n  n  n  -|  i  n  •• 

K  i 

Se  =  X  e  and  { en , . . , e,,}  an  o.n.  set;  and  with  u„  =  Uvn  and  t  =  (P+£x--Ke)M  (1+0) 
n  n  n  1  i\  n  n  n  -j  1 

for  K  +  1  <  n  <  M,  where  Svn  =  e  and  vK+i>-*’v^  is  an  o.n.  set.  The  sets 

{u-, ,. .  ,uK>  and  are  uniquely  defined  for  any  maximizing  Gaussian  y^)- 


Proof:  (a).  From  (5), 


CW(P)  =  sup 


M 


h  l  log  [1  +  xV^j,  where  yn  =  1  +  <Svn,  vn>2,  {vn,  n<M}  is  a 


c.o.n.  set,  and  the  supremum  is  over  all  such  c.o.n.  sets  and  all  (X  )  such  that 
M  2 

<  P.  Since  <  g2  -  • •  -  3^  are  the  non-decreasing  eigenvalues  of  S, 
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|  0  +  <SVV2 


J 

]  >£  [1  +  $n]  for  all  J  <  M  and  any  fixed  c.o.n.  set 


{vn»  n<M} .  The  expression  of  C^(P)  in  (a)  and  the  unique  covariance  of  the 
maximizing  Gaussian  both  now  follow  from  Lemma  1  and  Lemma  2.  The  same 

result  holds  if  dim  (H^)  =  L  <  °°  and  dim  [supp(y^^)]  ^  M  <  L,  since  in  this  case 

S  again  has  M  smallest  eigenvalues. 

(b)  -  (i).  if  s  >  el,  then  S-ei  does  not  have  m  smallest  eigenvalues. 

£  £  £ 

However,  given  any  e>0,  one  can  find  eigenvalues  y -j » -  -  such  that  0  <y1*  <  6+e 
for  i  <  M.  Using  this  in  (3)  one  obtains 

M 

I[yXY]  =  0/2)  l  log  [1  +  Tn] 


=  (1/2)  I  log  [1  +  (1  +  y®)“  ]  ^  (1/2)  I  log  [1  +  Xn  (1  +  e+e)  ]. 

2 

The  expression  on  the  right  of  the  inequality  is  maximized,  over  all  (Xn)  such  that 

M  M  _  "I 

VX^  <  P,  by  defining  X^  =  P/M,  n  <  M.  Thus,  Cw(P)>(1/2)£  log  [1  +  (1  +  0+e)  P/M] 
|  n  h  H  i 

M  i  _i 

for  all  e>0,  and  so  CW(P)  >  (1/2)7  log  [1  +  PM_,(1  +  0)  ].  For  the  reverse 

"1/2  2 

inequality,  one  notes  that  under  the  constraint  ||RN  A(X) | |2  <  P,  it  is 

X 


shown  in  [1]  that  CN(P)  =  (M/2)  log  (1  +  P/M).  For  S  >  01, 


HR-1 72  At X)  1 1 o  s  (H0)"1||lC1/?  AtX)  1 1|.  THUS,  E  ||R‘1/2  A(X)||2  <  P  implies 


E  |  |h"1/2  AUlllj  s  (l+el-'p,  giving  CwtP)  <  ||  log  [1  +  M/+0j]  .  so  that 
X 

CW(P)  =  (M/2)  log  [1  +  PM^O+o)"1]. 


If  S  >  el,  with  e  an  eigenvalue  of  multiplicity  K,  the  above  argument  is 
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modified  in  an  obvious  way  (y^  -  0  for  i  -  l,.., min  (K,M))  to  again  obtain 
CW(P)  =  (M/2)  log  [1  +  FM^Cl+e)"1]  . 


To  prove  (b-ii),  the  proof  of  (a)  is  repeated  after  substituting  A..  for  3 • , 
i  <  M. 


Now  suppose  that  S  has  K<M  strictly  negative  eigenvalues  A,  £  ...  £  L, 
K  1 

and  that  EA  ,  P  >  Kx>/.  C, , ( P )  =  sup  C.,(P,  ,v)  where 

1  i  +  "  K  W  (Prv)  W  1  ~ 


CW(PT  =  sup 


1  M  p  _i 

j  s  log  [i  +  \  (i  +  <svn,  vp > 2)  '] 


v_  =  {vn,  n  £  M}  is  any  o.n.  set,  0  £  P-|  <  P,  and  the  supremum  is  over  all  (Xn) 

K  2  M  2 

such  that  E  £  P^ ,  e  X^  £  P.  Repeating  the  analysis  of  (a)  and  (b-i),  one  finds  that 
1  1  "  J 


CW(P1  ,v)  =(]/2  )  E  log 
w  1  n=l 


1xi  +  Pt  +  J 


.  j(l  +  xn) 


+  (]/2  ) (M-K)  log  [1 


+  (M-KK1+0) 


J 

where  J  <  K  is  the  largest  integer  such  that  E  A.  +  P,  >  Ja,.  Since  this  result  holds 

—  -j  1  I  d 

for  any  o.n.  set  (v^ ,  n  £  M},  it  remains  only  to  determine  the  value  of  P^  that 
maximizes  Cj.lP,,yj  (a  differentiable  function  of  P-j  in  [0,P]).  Differentiating, 
one  sees  that  C^(P-.,vj  is  increasing  with  P-|  so  long  as 


P,  <  [JP  +  (M-K) (00  -  Za.)](M-K+J)_1 
1  1  1 


Since  P  <  JAj+.j 


J 

lx . ,  the  precedi ng 
1  1 


J 

inequality  is  satisfied  so  long  as  (M-K+J)Aj+-j  -  ^A -  <  P  +  (M-K)e  and  this  Is 

J 

satisfied  because  P  +  Ja..  >  JAJ+^ ,  Aj+^  <  e.  It  follows  that  C^(P-|  ,v)  is  an 


n 

K  K 

increasing  function  of  P-j  tor  P-j  <  -£x..  +  KA^,.  Assuming  that  P-j  >  -£x..  +  KA^, 


1 


1 


rlr, 


K. 


the  maximum  of  C..(P  ,v)  is  attained  uniquely  by  P,  =  M  [KP  -  (M-K)£a.  +  ( M-K) Ke] 
W  I  i  1  1 

Using  this  value  of  P-j  in  tne  expression  tor  U^(P^,v_),  one  obtains  CW(P)  as  in 

(b-iii).  The  statement  on  attaining  capacity  follows  from  the  results  of  (b-i) 

and  (b-ii).  □ 

Theorem  2:  Suppose  that  is  infinite-dimensional,  and  that  dim[supp(y^)] 
not  constrained. 

(a)  If  {a  ,n>l}  is  not  empty,  and  £(e-A  )  <  P,  then 


CW(P)  -  2  I  log 


1+6 


1  +A„ 


nJ 


1  P  +  I(^ m-0) 
1  “  m 

75-  m 

c  ~T  +T" 


(b)  If  {An,n>l}  is  not  empty,  and  P  <  \  (e-A^),  then  there  exists  a  largest 


1 


K 


integer  K  such  that  Y A .  +  P  >  KA  ,  and  C.dP)  =  -k  >  log 

l  1  K  W  L  n=l 

D 

(cj  If  { An ,n>l }  is  empty,  then  CW(P)  -  gTT+eT  • 


lx.  +  P  + 


K(l+An) 


(dj  In  (a) ,  the  capacity  can  be  attained  if  and  only  if  l  (e-A  )  =  P* 

n  n 

It  is  then  attained  by  a  Gaussian  p„„  with  covariance  operator  as  in  (2), 

where  u  =  Ue  and  t  =  (e-A  )(1+a  for  all  n>l .  In  (b),  the  capacity 
n  n  n  n'^n'  r 

can  be  attained  by  a  Gaussian  p ^  with  covariance  operator  (2),  where 


I A 1  +  P  +  K 

u„  =  Ue_  and  x  =  1  -  1  for  n<K;  x  =0  for  n>K.  In  (c), 

n  ■"  1T(T+Ty_  n 


n  n 


the  capacity  cannot  be  attained. 

Remarks.  Theorem  2  has  been  proved  for  the  case  e=0  in  [2].  Here  we  shall  omit 


the  detailed  proof  of  Theorem  2  above;  given  lemmas  3-5  below,  tne  proof  of  the 
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Theorem  in  [2]  can  be  applied  in  exactly  the  same  way  to  prove  the  present 
Theorem  2. 

Several  errata  in  [2  ]  should  be  mentioned  here.  The  expression  for 
Tn  (n<K)  in  the  first  line  of  p.  9  of  [ 2  ]  should  read: 
rK 


Tn  ‘  (1  +  V 


-1 


TX.  +  P  -  KX 
-1  1  n. 


/K.  The  expression  for  on  tne  next-to-last  line 
-1 


of  p.  8  should  read:  xn  =  -x  (1  +  Xn)"'  for  n>l .  On  p.  11,  sixth  line  from 
bottom,  Pi  should  be  P^,  in  the  fifth  line  from  bottom,  C(P1,K)  should  read 


C(PVK)  =  1  l  log 
1  L  n=l 


K 


i  +  P-i  +  K 


m  +  xn) . 


+  j  (P  -  P^).  These  are  all  typing  errata  and 


do  not  affect  tne  development  in  [2]. 

K 

Lemma  3.  Suppose  that  Un,n>l}  is  an  infinite  set,  and  P>0.  Then  P  +  |>n 

for  ail  K>1  if  and  only  if  P  >  l  (e-x  ). 

n>l  n 

K  K 

Proof.  P  +  l  (x„-e)  >  K(x^-e)  <=^  P  +  lxn  >  KX^;  the  result  follows  from 
—  1  n  *|  n  i\ 

Lemma  3  of  [  2 ] • 

Lemma  4.  Suppose  that  S-6I  is  strictly  negative. 

(a)  If  P  >  £(e-x  ),  then  CW(P)  =  \  l  log 


>  KX 


K 


fl+e  ' 

.  1 

>  +  Kv0)" 

l>xn- 

2 

1  +  e 

The  capacity  can  be  attained  can  be  attained  if  and  only  if  P  =  l  (e-x  ). 

n 

It  is  then  attained  by  a  Gaussian  yAX  with  covariance  operator  (2),  where 

^  Tor  a  1 1 

K 


un  =  Uen  and  xn  =  (e-xn) (l+xn)“  for  all  n>l . 


(b)  If  P  <  l  (e-x  ) ,  then  CW(P)  =  j  l  log 
n  o-l 


K 


lx,  +  P  +  K 

T_| _ 

;  ko+x  ) 


where  K<°°  is  the 


K 


largest  integer  such  that  P  +  £xp  >  KX^,.  The  capacity  can  be  attained  by 
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Proof. 


a  Gaussian  yftx  with  covariance  operator  [ 2 ),  where  un  =  Uen  and 


h 


+  P  +  K 


1 


K(Th^I 


-  1  for  n<K 


Tn  =  0 


for  n>K. 
(a) .  The  fact  that 


2C  (P)  >  l  log 
W  n>l 


l+e 


1+A. 


p  +  y  (a  -6 ) 

^nr  m  ' 


1  +  6 


(*) 


follows  from  b-(iii)  of  Theorem  1,  letting  M  ->  °°  in  that  result.  To  prove 
the  reverse  inequality,  one  repeats  the  proof  of  part  (a)  of  Lemma  4  in 
[  2]»  substituting  the  RHS  of  (*)  above  for  the  RHS  of  (*)  in  [2  ,  P-11]* 
For  this,  one  uses  the  fact  that 


M 

l  l09 

n=l 


rM  -i 

£a  .  +  P  +  M 

M 

1 

=  I  log 
n=l 

’l+e 

+  M  log 

M(V+xny  ] 

L1+v 

1  + 


M 

I  U-j-e)  +  p 

i=i 


M(l+0) 


The  result  on  attaining  capacity  is  proved  in  exactly  the  same  way  as  the 
corresponding  result  in  [2  ],  again  after  substituting  the  KHS  of  (*)  above 
for  the  RHS  of  (*)  in  [  2 ,  p.ll]* 


1 


K 


lb).  CW(P)  >  £  l  log 


•  K 

y  A.  +  P  +  K 
i=l  1 


K(l+An) 


(**) 


follows  from  (b-ii)  of  Theorem  1.  Suppose  that  C^P)  >  RHS(**).  Then  by 

2 

(5)  there  exists  a  c.o.n.  set  {vn,  n>l }  and  a  sequence  (Xn)  with  an 
infinite  number  of  non-zero  terms  (using  (b-ii J  of  Theorem  1)  such  that 

RHS(**)  <  1  l  log  [1  +  X^(l  +  <Svn,vn>2)"1]  (***) 

£  n 

with  £x‘  <  P.  As  l  X^  <  °°  and  (1  +  <svn,vn>)  is  a  bounded  sequence, 

1  M  2  -1 

(***)  implies  that  tor  some  M<»,  RHS(**)  <  jr  1  log  C1  +  X  (1  +  <Sv  ,v  >2)  ]• 

n=l 


This  contradicts  b-(ii)  of  Theorem  1. 


□ 
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Lemma  5^.  If  S  >  el,  then  C^(P)  =  ^pjr§y  • 

Proof.  CW(P)  >  P(l  +  0)”^2_1  follows  from  part  b-(ij  of  iheorem  1,  by  letting 
M  ->  oo.  To  prove  the  reverse  inequality,  one  notes  that  for  the  constraint 


•1/2 


Ey  MV'1”  A(X)|||  <  (l+e)_lP,  tne  capacity  (CN[(l+e)“‘P])  is  P(l+e)  ‘2 


,-1, 


,-l, 


lo-l 


IN1 


[  1,  Theorem  2].  Since  E  |  [R^1  A(X)  | \\  <  P  implies 


E  | | R"1  A(X; |||  <  0+e)_1P»  optimization  w.r.t.  the  former  constraint  is  over 


a  smaller  set  than  w.r.t.  the  latter  constraint;  thus  CW(P)  <  CM[(l+e)  ‘P]. 


'Nl 


□ 


Comparisons  of  C^(P)  and  C^(P) 

For  the  finite-dimensional  channel,  the  capacity  C^(P)  given  in  Theorem  1(a) 

,  M  K 

is  strictly  greater  than  C^(P)  (=  ^  log  [1+P/M])  if  <  0,  or  if  P  +  ^  <  0. 

CMP)  <  CN(P)  if  0  <  6-|  <  3^.  The  verification  is  omitted. 

For  the  infinite-dimensional  channel,  a  general  statement  can  be  made  if 
{ An ,n>l }  is  empty.  Then,  CW(P)  >  CN(P)  if  0<O,  CW(P)  <  CN(P)  if  0>O,  CW(P)  =  CN(P) 

if  9=0;  see  Theorem  1  (b-i)  and  Theorem  2  (c).  Note  that  C^ ( P J  =  P/2  for  the 

unconstrained  channel  [1,  Theorem  2  ]. 

If  { , n>l }  is  not  empty,  then  for  the  unconstrained  channel  the  value  of 

P  -1 

CW(P)  given  in  Theorem  2(a)  is  greater  than  ^+Q ^  ,  using  log  x  >  1-x.  This 
inequality  can  also  be  shown  for  the  value  given  In  Theorem  2(b),  proceeding  as 
in  the  proof  of  part  (b)  of  the  Theorem  in  [2].  Thus,  for  the  unconstrained 
channel,  C^(P)  >  C^(P)  if  o<0  and  f a^ , n>l }  is  not  empty.  A  similar  result  can  be 
obtained  for  the  constrained  channel. 


»  « 
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Discussion. 

The  mismatched  cnannel  differs  from  the  matched  channel  in  several  ways. 

First,  the  value  of  the  capacity  can  be  very  different,  as  already  seen.  Secondly, 

the  problem  of  attaining  capacity  is  much  more  significant.  Even  in  the 
finite-dimensional  channel  the  vectors  Uj ,11^  must  be  a  specific  set  of 
vectors,  not  just  any  o.n.  set.  If  H2  is  infinite-dimensional  with 
dim[supp(uA^)]  <  M,  the  situation  is  even  worse  in  (b-iii)  of  Theorem  1.  That 
is,  capacity  can  then  be  attained  only  if  S  has  zero  as  an  eigenvalue  of 
multiplicity  >  M  when  S  <  el,  or  of  multiplicity  ^  M-K  when  S  has  K  <  M  strictly 


K 

negative  eigenvalues  A-|  s  . .  <  A  ^  and  P  +  ^  KA^.  Otherwise,  in  order  to 

approach  capacity,  one  will  need  to  put  part  of  the  available  "energy"  P  in 

elements  (Uen)  where  (en)  are  eigenvectors  of  S  corresponding  to  successively 

smaller  eigenvalues.  In  practical  applications,  this  usually  corresponds  to 

eigenfunctions  at  higher  and  higher  frequencies. 

For  the  infinite-dimensional  channel  without  a  constraint  on  dim[supp(uAX) J , 

again  there  can  be  significant  differences  between  C^P)  and  C^P),  depending  on 

(e;  A  ,  n>l}.  However,  in  this  case  one  sees  a  rather  different  situation  in  the 

problem  of  attaining  capacity.  C^(P)  can  never  be  attained;  C^(P)  can  be  attained 

if  and  only  if  { A  ,n>l }  is  not  empty  and  P  <  l  (e-A  ). 

n  n 

It  may  be  noted  that  the  results  given  in  (a)  and  (b-ii)  of  Theorem  1  are 
similar  to  those  obtained  in  [4,  p.  170],  although  the  developments  are  quite 
different.  However,  those  previous  results  are  given  in  terms  of  a  constraint 
on  E 1 1 AC X) 1 1 1 ,  and  assume  that  the  noise  variance  components  can  be  arranged  in 
ascending  order.  This  can  only  be  done  if  the  channel  is  finite-dimensional.  In 
that  case,  one  can  take  RR  =  I,  the  identity,  and  thereby  use  a  true  power  constraint. 
The  assumption  (A-l)  becomes  R^  =  I  +  S,  and  the  capacity  is  as  given  in  (a];  this 
agrees  with  the  referenced  results  in  [4]. 


t  4 
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